Hierarchical Modular Optimization of Convolutional Networks Achieves Representations Similar to Macaque IT and Human Ventral Stream
نویسندگان
چکیده
Humans recognize visually-presented objects rapidly and accurately. To understand this ability, we seek to construct models of the ventral stream, the series of cortical areas thought to subserve object recognition. One tool to assess the quality of a model of the ventral stream is the Representational Dissimilarity Matrix (RDM), which uses a set of visual stimuli and measures the distances produced in either the brain (i.e. fMRI voxel responses, neural firing rates) or in models (features). Previous work has shown that all known models of the ventral stream fail to capture the RDM pattern observed in either IT cortex, the highest ventral area, or in the human ventral stream. In this work, we construct models of the ventral stream using a novel optimization procedure for category-level object recognition problems, and produce RDMs resembling both macaque IT and human ventral stream. The model, while novel in the optimization procedure, further develops a long-standing functional hypothesis that the ventral visual stream is a hierarchically arranged series of processing stages optimized for visual object recognition.
منابع مشابه
Short-latency category specific neural responses to human faces in macaque inferotemporal cortex
In this article I would present evidence to show that timing of the flow of neural signals within the ventral visual stream is a crucial part of the neural code for categorization of faces. We recorded the activity of 554 inferotemporal neurons from two macaque monkeys performing a fixation task. More than 1000 object images including human and non-primate animal faces were presented up to 10 t...
متن کاملShort-latency category specific neural responses to human faces in macaque inferotemporal cortex
In this article I would present evidence to show that timing of the flow of neural signals within the ventral visual stream is a crucial part of the neural code for categorization of faces. We recorded the activity of 554 inferotemporal neurons from two macaque monkeys performing a fixation task. More than 1000 object images including human and non-primate animal faces were presented up to 10 t...
متن کاملA Modified Grasshopper Optimization Algorithm Combined with CNN for Content Based Image Retrieval
Nowadays, with huge progress in digital imaging, new image processing methods are needed to manage digital images stored on disks. Image retrieval has been one of the most challengeable fields in digital image processing which means searching in a big database in order to represent similar images to the query image. Although many efficient researches have been performed for this topic so far, t...
متن کاملA performance - optimized model of neural responses across the ventral visual stream
12 Human visual object recognition is subserved by a multitude of cortical areas. To make sense 13 of this system, one line of research focused on response properties of primary visual cortex 14 neurons and developed theoretical models of a set of canonical computations such as convolution, 15 thresholding, exponentiating and normalization that could be hierarchically repeated to give 16 rise t...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کامل